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ABSTRACT 

Objective To estimate the infection fatality rate of coronavirus disease 2019 (COVID-19) from 
data of seroprevalence studies. 

Methods Population studies with sample size of at least 500 and published as peer-reviewed 
papers or preprints as of July 11, 2020 were retrieved from PubMed, preprint servers, and 
communications with experts. Studies on blood donors were included, but studies on healthcare 
workers were excluded. The studies were assessed for design features and seroprevalence 
estimates. Infection fatality rate was estimated from each study dividing the number of COVID-19 
deaths at a relevant time point by the number of estimated people infected in each relevant region. 
Correction was also attempted accounting for the types of antibodies assessed. Secondarily, 
results from national studies were also examined from preliminary press releases and reports 
whenever a country had no other data presented in full papers of preprints. 

Results 36 studies (43 estimates) were identified with usable data to enter into calculations and 
another 7 preliminary national estimates were also considered for a total of 50 estimates. 
Seroprevalence estimates ranged from 0.222% to 47%. Infection fatality rates ranged from 0.00% 
to 1.63% and corrected values ranged from 0.00% to 1.31%. Across 32 different locations, the 
median infection fatality rate was 0.27% (corrected 0.24%). Most studies were done in pandemic 
epicenters with high death tolls. Median corrected IFR was 0.10% in locations with COVID-19 
population mortality rate less than the global average (<73 deaths per million as of July 12, 2020), 
0.27% in locations with 73-500 COVID-19 deaths per million, and 0.90% in locations exceeding 
500 COVID-19 deaths per million. Among people <70 years old, infection fatality rates ranged 
from 0.00% to 0.57% with median of 0.05% across the different locations (corrected median of 
0.04%). 
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Conclusions The infection fatality rate of COVID-19 can vary substantially across different 
locations and this may reflect differences in population age structure and case-mix of infected and 
deceased patients as well as multiple other factors. Estimates of infection fatality rates inferred 
from seroprevalence studies tend to be much lower than original speculations made in the early 
days of the pandemic. 
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The infection fatality rate (IFR), the probability of dying for a person who is infected, is 
one of the most critical and most contested features of the coronavirus disease 2019 (COVID-19) 
pandemic. The expected total mortality burden of COVID-19 is directly related to the IFR. 
Moreover, justification for various non-pharmacological public health interventions depends 
crucially on the IFR. Some aggressive interventions that potentially induce also more pronounced 
collateral harms^ may be considered appropriate, if IFR is high. Conversely, the same measures 
may fall short of acceptable risk-benefit thresholds, if the IFR is low. 

Early data from China, adopted also by the World Health Organization (WHO),^ focused 
on a crude case fatality rate (CFR) of 3.4%; CFR is the ratio of COVID-19 deaths divided by the 
number of documented cases, i.e. patients with symptoms who were tested and found to be PCR- 
positive for the virus. The WHO envoy who visited China also conveyed the message that there 
are hardly any asymptomatic infections.^ With a dearth of asymptomatic infections, the CFR 
approximates the IFR. Other mathematical models suggested that 40-70%,^ or even^ 81% of the 
global population would be infected. Influential mathematical models^’^ eventually dialed back to 
an IFR of 1.0% or 0.9%, and these numbers long continued to be widely cited and used in both 
public and scientific circles. The most influential of these models, constructed by Imperial College 
estimated 2.2 million deaths in the USA and over half a million deaths in the UK in the absence of 
lockdown measures.^ Such grave predictions justifiably led to lockdown measures adopted in 
many countries. With 0.9% assumed infection fatality rate and 81% assumed proportion of people 
infected, the prediction would correspond to a global number of deaths comparable with the 1918 
influenza, in the range of 50 million fatalities. 

Since late March 2020, many studies have tried to estimate the extend of spread of the 
virus in various locations by evaluating the seroprevalence, i.e. how many people in population 
samples have developed antibodies for the virus. These studies can be useful because they may 
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inform about the extend of under-ascertainment of documenting the infection based on PCR 
testing. Moreover, they can help obtain estimates about the IFR, since one can divide the number 
of observed deaths by the estimated number of people who are inferred to have been infected. 

At the same time, seroprevalence studies may have several caveats in their design, 
conduct, and analysis that may affect their results and their interpretation. Here, data available as 
of July 11, 2020 were collected, scrutinized, and used to infer estimates of IFR in different 
locations where these studies have been conducted. 

METHODS 
Seroprevalence studies 

The input data for the calculations of IFR presented here are studies of seroprevalence of 
COVID-19 that have been done in the general population, or in samples that might approximate 
the general population (e.g. with proper reweighting) and that have been published in peer- 
reviewed journals or have been submitted as preprints as of July 11, 2020. Only studies with at 
least 500 assessed samples were considered, since smaller datasets would entail extremely large 
uncertainty for any calculations to be based on them. When studies focused on making 
seroprevalence assessments at different time interval, they were eligible if at least one time 
interval assessment had a sample size of at least 500 participants; among different eligible time 
points, the one with the highest seroprevalence was selected, since seroprevalence may decrease 
over time as antibody titers wane. Studies with data collected over more than a month, and that 
could not be broken into at least one eligible time interval that did not exceed one month in 
duration were excluded, since it would not be possible to estimate a point seroprevalence with any 
reliability. Studies were eligible regardless of the exact age range of included participants, but 
studies including only children were excluded. 
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Studies where results were only released through press releases were not eonsidered here, 
since it is very difficult to tell much about their design and analysis, and this is fundamental in 
making any inferences based on their results. Nevertheless, secondarily, results from national 
studies were also examined from preliminary press releases and reports whenever a country had 
no other data presented in full papers of preprints as of July 11, 2020. This allowed these countries 
to be represented in the collected data, but extra caution is required given the preliminary nature of 
this information. Preprints should also be seen with caution since they have not been yet fully 
peer-reviewed (although some of them have already been revised based on very extensive 
comments from the scientific community). However, in contrast to press releases, preprints 
typically offer at least a fairly complete paper with information about design and analysis. 

Studies done of blood donors were eligible, although it is possible they may underestimate 
seroprevalence and overestimate IFR due to healthy volunteer effect. Studies done on health care 
workers were not eligible, since they deal with a group at potentially high exposure risk which 
may lead to seroprevalence estimates much higher than the general population and thus 
implausibly low IFR. For a similar reason, studies focused on communities (e.g. shelters or 
religious or other shared-living communities) were also excluded. Studies were eligible regardless 
of whether they aimed to evaluate seroprevalence in large or small regions, provided that the 
population of reference in the region was at least 5000 people. 

Searches were made in PubMed (LitCOVID), medRxiv, bioRxiv, and Research Square 
using the terms “seroprevalence” and “antibodies” with continuous updates (last update July 11, 
2020). Communication with colleagues who are field experts sought to ascertain if any major 
studies might have been missed. 

Information was extracted from each study on location, recruitment and sampling strategy, 
dates of sample collection, sample size, types of antibody used (IgG, IgM, IgA), estimated crude 
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seroprevalence (positive samples divided by all samples test), and adjusted seroprevalence and 
features that were considered in the adjustment (sampling process, test performance, presence of 
symptoms, other). 

Calculation of inferred IFR 

Information on the population of the relevant location was collected from the papers. 
Whenever it was missing, it was derived based on recent census data trying to approximate as 
much as possible the relevant catchment area (e.g. region(s) or county(ies)), whenever the study 
did not pertain to an entire country. Some studies targeted specific age groups (e.g. excluding 
elderly people and/or excluding children) and some of them made inferences on number of people 
infected in the population based on specific age groups. For consistency, the entire population, as 
well as, separately, only the population with age <70 years were used for estimating the number of 
infected people. It was assumed that the seroprevalence would be similar in different age groups, 
but significant differences in seroprevalence according to age strata that had been noted by the 
original authors were also recorded to examine the validity of this assumption. 

The number of infected people was calculated multiplying the relevant population with the 
adjusted estimate of seroprevalence. Whenever an adjusted seroprevalence estimate had not been 
obtained, the unadjusted seroprevalence was used instead. When seroprevalence estimates with 
different adjustments were available, the analysis with maximal adjustment was selected. 

For the number of COVID-19 deaths, the number of deaths recorded at the time chosen by 
the authors of each study was selected, whenever the authors used such a death count up to a 
specific date to make inferences themselves. If the choice of date had not been done by the 
authors, the number of deaths accumulated until after 1 week of the mid-point of the study period 
was chosen. This accounts for the differential delay in developing antibodies versus dying from 
the infection. It should be acknowledged that this is an averaging approximation, because some 
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patients may die very soon (within <3 weeks) after infeetion (and thus are overeounted), and 
others may die very late (and thus are undercounted due to right censoring). 

The inferred IFR was obtained by dividing the number of deaths by the number of infected 
people for the entire population, and separately for people <70 years old. The proportion of 
COVID-19 deaths that occurred in people <70 years old was retrieved from situational reports for 
the respective countries, regions, or counties in searches done in June 3-7 for studies published 
until June 7 and in July 3-11 for studies published later. A corrected IFR is also presented, trying 
to account for the fact that only one or two types of antibodies (among IgG, IgM, IgA) might have 
been used. Correcting seroprevalence upwards (and inferred IFR downwards) by 1.1-fold for not 
performing IgM measurements and similarly for not performing IgA measurements may be 
reasonable, based on some early evidence,^ although there is uncertainty about the exact correction 
factor. 

Data synthesis considerations 

Inspection of the IFR estimates across all locations showed vast heterogeneity with 
heterogeneity I^ exceeding 99.9% and thus a meta-analysis would be inappropriate to report across 
all locations. Quantitative synthesis with meta-analysis across all locations would also be 
misleading since locations with high seroprevalence would tend to carry more weight than 
locations with low seroprevalence; locations with more studies (typically those that have attracted 
more attention because of high death tolls and thus high IFRs) would be represented multiple 
times in the calculations; and more sloppy studies with fewer adjustments would get more weight, 
because they would have spuriously tighter confidence intervals than more rigorous studies with 
more careful adjustments allowing for more uncertainty. Finally, with a highly skewed IFR 
distribution and with extreme between-study heterogeneity, synthesis with a typical random 
effects model would tend to produce an erroneously high summary IFR that approximates the 
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mean of the study-specific estimates (also heavily driven by hotbed high-mortality locations with 
more studies done), while for a skewed distribution the median is more appropriate. 

Therefore, at a first step, IFR estimates from studies done in the same country (or in the 
US, the same state) were grouped together and a single IFR was obtained for that location, 
weighting the study-specific IFRs by the sample size of each study. This allowed to avoid giving 
inappropriately more weight to studies with higher seroprevalence estimates and those with 
seemingly tighter confidence intervals because of poor or no adjustments, while still giving more 
weight to larger studies. Then, a single summary estimate was used for each location and the 
median of the distribution of location-specific IFR estimates was calculated. Finally, it was 
explored whether the location-specific IFRs were associated with the COVID-I9 mortality rate in 
the population (COVID-19 deaths per million people) in each location as of July 12, 2020; this 
allowed to assess whether IFR estimates tend to be higher in harder hit locations. 

RESULTS 

Seroprevalence studies 

36 studies with a total of 43 eligible estimates were published either in the peer-reviewed 
literature or as preprints as of July 11, 2020.*'^^ Dates and processes of sampling and recruitment 
are summarized in Table 1, sample sizes, antibody types assessed and regional population appear 
in Table 2, estimated prevalence, and number of people infected in the study region are 
summarized in Table 3, and number of COVID-19 and inferred IFR estimates are found in Table 
4. Several studies performed repeated seroprevalence surveys at different time points, and only the 
time point with the highest seroprevalence estimate is considered in the calculations. With three 
exceptions, this is also the latest time point. Furthermore, another 7 preliminary national estimates 
were also considered (Table 5)"^'^'^° from countries that had no other seroprevalence study 
published as a full paper or preprint. This yielded a total of 50 eligible estimates. 
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At least seven studies found some statistieally significant, modest differences in 
seroprevalence rates across some age groups (Oise: decreased seroprevalence in age 0-14, 
increased in age 15-17; Geneva: decreased seroprevalence in age >50; Netherlands: increased 
seroprevalence in age 18-30; New York state: decreased seroprevalence in age >55; Brooklyn: 
decreased seroprevalence in age 0-5, increased in age 16-20; Tokyo: increased seroprevalence in 
age 18-34, Spain: decreased seroprevalence in age 0-10, Belgium: higher seroprevalence in age 
>90). The patterns are not strong enough to suggest major differences in extrapolating across age 
groups, although higher values in adolescents and young adults and lower values in children 
cannot be excluded. 

As shown in Table 1, these studies varied substantially in sampling and recruitment 
designs. The main issue is whether they can offer a representative picture of the population in the 
region where they are performed. A generic problem is that vulnerable people who are at high risk 
of infection and/or death may be more difficult to recruit in survey-type studies. COVID-19 
infection seems to be particularly widespread and/or lethal in nursing homes, among homeless 
people, in prisons, and in disadvantaged minorities. Most of these populations are very difficult, or 
even impossible to reach and sample from and they are probably under-represented to various 
degrees (or even entirely missed) in surveys. This would result in an underestimation of 
seroprevalence and thus overestimation of IFR. Eleven of the 36 studies that are available as full 
papers (Iran,^ Geneva,Gangelt,'^ Rio Grande do Sul,*^ Luxembourg,^° Los Angeles county, 
three Brazil studies,^^’^^’^^, Spain,^^ and Louisiana^^) explicitly aimed for random sampling from 
the general population. In principle, this is a stronger design. However, even with such designs, 
people who cannot be reached (e.g. by e-mail or phone or even visiting them at a house location) 
will not be recruited, and these vulnerable populations are likely to be missed. Moreover, 5 of 
these 11 studies^’*'*’^^’'^^’^^ focused on studying geographical locations that had extreme numbers of 
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deaths, higher than other locations in the same city or country, and this would tend to select 
eventually for higher IFR on average. 

Seven studies assessed blood donors in Denmark,*^ Netherlands,*^ Scotland,** the Bay 
Area in California,^"* Zurich/Luceme,^* Apulia** and Rio De Janeiro."** By definition these studies 
include people in good health and without symptoms, at least recently, and therefore may 
markedly underestimate COVID-19 seroprevalence in the general population. A small set of 200 
blood donors in Oise, France** showed 3% seroprevalence, while pupils, siblings, parents, 
teachings and staff at a high school with a cluster of cases in the same area had 25.9% 
seroprevalence; true population seroprevalence may be between these two values. 

For the other studies, healthy volunteer bias may lead to underestimating seroprevalence 
and this is likely to have been the case in at least one case (the Santa Clara study)*^ where wealthy 
healthy people were rapidly interested to be recruited when the recruiting Facebook ad was 
released. The design of the study anticipated correction with adjustment of the sampling weights 
by zip code, gender, and ethnicity, but it is likely that healthy volunteer bias may still have led to 
some underestimation of seroprevalence. Conversely, attracting individuals who might have been 
concerned of having been infected (e.g. because they had symptoms) may lead to overestimation 
of seroprevalence in surveys. Finally studies of employees, grocery store clients, or patient cohorts 
(e.g. hospitalized for other reasons, or coming to the emergency room, or studies using residual 
lab samples) may have sampling bias with unpredictable direction. 

As shown in Table 2, all studies have tested for IgG antibodies, but only about half have 
also assessed IgM, 4 have assessed IgA. Only three studies assessed all three types of antibodies 
and one more used a pan-Ig antibody. Studies typically considered the results to be “positive” if 
any tested antibody type was positive, but one study (Luxembourg) that considered the results to 
be “positive” only if both IgG and IgA were detected. The ratio of people sampled versus the total 
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population of the region was better than 1:1000 in 11 studies (Idaho,^ Denmark blood donors,*^ 
Gangelt,^^ Santa Clara,Luxembourg,^® Brooklyn,^’ Zurich,^^ San Franeiseo,^^ Espirito Santo,^^ 
Spain,^® and Vitaoura"^^). 

Seroprevalence estimates 

As shown in Table 3, prevalenee ranged from as little as 0.222% to as high as 47%. 

Studies varied a lot on whether they tried or not to adjust their estimates for test performance, 
sampling (striving to get closer to a more representative sample), and clustering effects (e.g. when 
including same household members) as well as other factors. The adjusted seroprevalence 
occasionally differed substantially from the crude, unadjusted value. In principle adjusted values 
are likely to be closer to the true estimate, but the exercise shows that each study alone may have 
some unavoidable uncertainty and fluctuation, depending on the analytical choices preferred. In 
studies that sampled people from multiple locations, large between-location heterogeneity could 
be seen (e.g. 0-25% across 133 Brazilian cities). 

Inferred IFR 

Inferred IFR estimates varied a lot, from 0.00% to 1.63%. Corrected values also varied 
extensively, from 0.00% to 1.31%. For 10 locations, more than one IFR estimate was available 
and thus IFR from different studies evaluating the same location could be compared. As shown in 
figure 1, the IFR estimates tended to be more homogeneous within each location, while they 
differed a lot across locations. The sample size-weighted summary was used to generate a single 
estimate to represent each location. Data were available for 32 different locations. The median IFR 
across all 32 locations was 0.27% (0.24% using the corrected values). Most data came from 
locations with high death tolls and 23 of the 32 locations had a population mortality rate (deaths 
per million population) higher than the global average (73 deaths per million population as of July 
12) (Figure 2). The uncorrected IFR estimates had a range of 0.01-0.16% (median 0.13%) across 
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the 9 locations with population mortality rate below the global average, 0.07-0.73% (median 
0.27%) across the 15 locations with population mortality rate above the global average but below 
500 deaths per million population, and 0.59-1.63% (median 1.12%) across the 8 extreme hotbed 
locations with over 500 deaths per million population. The corrected IFR estimates had medians 
of 0.10%, 0.25%, and 0.90%, respectively, for the three groups of locations. 

The proportion of COVID-19 deaths that occurred in people <70 years old varied 
substantially across locations. All deaths in Gangelt were in elderly people while in Wuhan half 
the deaths occurred in people <70 years old and the proportion might have been higher in Iran, but 
no data could be retrieved for this country. When limited to people <70 years old, IFR ranged 
from 0.00% to 0.57% with median of 0.05% (corrected, 0.00-0.46% with median of 0.04%). IFR 
estimates in people <70 years old were lower than 0.1% in all but 7 locations that were hard-hit 
hotbeds (Belgium, Wuhan, Italy, Spain, Connecticut, Louisiana, New York). 

DISCUSSION 

IFR is not a fixed physical constant and it can vary substantially across locations, 
depending on the population structure, the case-mix of infected and deceased individuals and 
other, local factors. Inferred IFR values based on emerging seroprevalence studies typically show 
a much lower fatality than initially speculated in the earlier days of the pandemic. 

The studies analyzed here represent 50 different estimates of IFR, but they are not fully 
representative of all countries and locations around the world. Most of them come from locations 
with overall COVID-19 mortality rates exceeding the global average (73 deaths per million people 
as of July 12). The median inferred IFR in locations with COVID-19 mortality rate below the 
global average is low (0.13%, corrected 0.10%). For hotbed countries with COVID-19 mortality 
rates above the global average but lower than 500 deaths per million, the median IFR is still not 
that high (median 0.27%, corrected 0.25%). Very high IFR estimates have been documented 
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practically in locations that had devastating experiences with COVID-19. Such epicenters are 
unusual across the globe, but they are overrepresented in the 50 seroprevalence estimates available 
for this analysis. Therefore, if one could sample equally from all countries and locations around 
the globe, the median IFR might be even lower than the one observed in the current analysis. 

Several studies in hard-hit European countries inferred modestly high IFR estimates for the 
overall population, but the IFR was still low in people <70 years old. Some of these studies were 
on blood donors and may have underestimated seroprevalence and overestimated IFR. One study 
in Germany aimed to test the entire population of a city and thus selection bias is minimal: 
Gangelt^® represents a situation with a superspreader event (in a local carnival) and 7 deaths were 
recorded, all of them in very elderly individuals (average age 81, sd 3.5). COVID-19 has a very 
steep age gradient of death risk.^* It is expeeted therefore that in locations where the infection 
finds its way into killing predominantly elderly citizens, the overall, age-unadjusted IFR would be 
higher. However, IFR would still be very low in people <70 in these loeations, e.g. in Gangelt IFR 
is 0.00% in non-elderly people. Similarly, in Switzerland, 69% of deaths occurred in people >80 
years old^* and this explains the relatively high overall IFR in Geneva and Zurich. Similar to 
Germany, very few deaths in Switzerland have been recorded in non-elderly people, e.g. only 
2.5% have oecurred in people <60 years old and IFR in that age-group would be -0.01%. The 
majority of deaths in most of the hard hit European countries have happened in nursing homes^^ 
and a large proportion of deaths also in the US^^ also follow this pattern. Moreover, many nursing 
home deaths have no laboratory confirmation and thus should be seen with extra caution in terms 
of the causal impact of SARS-CoV-2. 

Focations with high burdens of nursing home deaths may have high IFR estimates, but the 
IFR would still be very low among non-elderly, non-debilitated people. The average length of stay 
in a nursing home is slightly more than 2 years and people who die in nursing homes die in a 
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median of 5 months^"^ so many COVID-19 nursing home deaths may have happened in people 
with life expectancy of only a few months. This needs to be verified in careful assessments of 
COVID-19 outbreaks in nursing homes with detailed risk profiling of fatalities. If COVID-19 
happened in patients with very limited life expectancy, this pattern may even create a dent of less 
than expected mortality in the next 3-6 months after the coronavirus excess mortality wave. As of 
July 12 (week 28), preliminary Euromonitor data^^ indeed already show a substantial dent below 
baseline mortality in France, and a dent below baseline mortality is seen also for the aggregate 
European data. 

Within China, the much higher IFR estimates in Wuhan versus other areas may reflect the 
wide spread of the infection to hospital personnel and the substantial contribution of nosocomial 
infections to a higher death toll in Wuhan;^^ plus unfamiliarity with how to deal with the infection 
in the first location where COVID-19 arose. Massive deaths of elderly individuals in nursing 
homes, nosocomial infections, and overwhelmed hospitals may also explain the very high fatality 
in specific locations in Italy^^ and in New York and neighboring states. Seroprevalence studies in 
health care workers and administrative hospital staff in Eombardy^^ found 8% seroprevalence in 
Milan hospitals and 35-43% in Bergamo hospitals, supporting the scenario for widespread 
nosocomial infections among vulnerable patients. The high IFR values in New York metropolitan 
area and neighboring states are also not surprising, given the vast death toll witnessed. A very 
unfortunate decision of the several state governors was to have COVID-19 patients sent to nursing 
homes. Moreover, some hospitals in New York City hotspots reached maximum capacity and 
perhaps could not offer optimal care. Use of unnecessarily aggressive management (e.g. 
mechanical ventilation) and hydroxychloroquine may also have contributed to worse outcomes. 
Furthermore, New York City has an extremely busy, congested public transport system that may 
have exposed large segments of the population to high infectious load in close contact 
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transmission and, thus, perhaps more severe disease. A more aggressive viral elade has also been 
speculated, but this needs further verification.^^ 

IFR may reach very high levels among disadvantaged populations and settings that have 
the worst combination of factors predisposing to higher fatalities. Importantly, such hotspot 
locations are rather uncommon exceptions in the global landscape. Moreover, even in these 
locations, the IFR for non-elderly individuals without predisposing conditions may remain very 
low. E.g. in New York City only 0.65% of all deaths happened in people <65 years without major 
underlying conditions.^* Thus the IFR even in New York City would probably be lower than 
0.01% in these people. 

Studies with extremely low inferred IFR are also worthwhile discussing. Possible 
overestimation of seroprevalence and undercounting of deaths need to be considered. E.g., for 
Kobe, the authors of the study** raise the question whether COVID-19 deaths have been 
undercounted in Japan. Both undercounting and overcounting of COVID-19 deaths may be a 
caveat in different locations and this is difficult to settle in the absence of very careful scrutiny of 
medical records and autopsies. The Tokyo data,^^ nevertheless, also show similarly very low IFR. 
Moreover, evaluation of all-cause mortality in Japan has shown no excess deaths during the 
pandemic, consistent with the possibility that somehow the Japanese population was spared. Very 
low IFRs seem common in Asian countries, including China (excluding Wuhan), Iran, Israel and 
India. Former immunity from exposure to other coronaviruses, genetic differences, hygienic 
etiquette, lower infectious load, and other unknown factors may be speculated. IFR seems to be 
very low also in Singapore where extensive PCR testing was carried out. As of July 12, 2020, in 
Singapore there were only 26 deaths among 46,283 cases, suggesting an upper bound of 0.06% for 
IFR, even if no cases had been missed. 
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Some surveys have also been designed to assess seroprevalenee repeatedly spaeing out 
measurements in the same population over time. A typical pattern that seems to emerge is that 
seroprevalenee may increase several fold within a few weeks, but plateau or even decline may 
follow.A more prominent decline of seropositivity was seen in a study in Wuhan.Genuine 
decrease may be difficult to differentiate from random variation. However, some preliminary 
data^O’^i suggest that decrease in antibody titers may be fast. Decrease in seropositivity over time 
means that the numbers of infected people may be underestimated and IFR overestimated. 

The only data from a low-income country among the 23 studies examined here come from 
Iran* and India"^^ and the IFR estimates appears to be very low. Iran has a young population with 
only slightly over 1% of the age pyramid at age >80 and India’s population is even younger. 
Similar considerations apply to almost every less developed country around the world. Given the 
very sharp age gradient and the sparing of children and young adults from death by COVID-19, 
one may expect IFR to be fairly low in the less developed countries. However, it remains to be 
seen whether comorbidities, poverty and frailty (e.g. malnutrition) and congested urban living 
circumstances may have adverse impact on risk and thus increase IFR also in these countries. 

One should caution that the extent of validation of the antibody assays against positive and 
negative controls differs across studies. Specificity has typically exceeded 99.0%, which is 
reassuring. However, for very low prevalence rates, even 99% specificity may be problematic. 
Sensitivity also varies from 60-100% in different validation exercises and for different tests, but 
typically it is closer to the upper than the lower bound. One caveat about sensitivity is that 
typically the positive controls are patients who had symptoms and thus were tested and found to 
be PGR-positive. However, it is possible that symptomatic patients may be more likely to develop 
antibodies than patients who are asymptomatic or have minimal symptoms and thus had not 
sought PGR testing.For example, one study found that 40% of asymptomatic patients became 
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seronegative within 8 weeks.Sinee the seroprevalence studies specifieally try to unearth these 
asymptomatic/mildly symptomatic missed infections, a lower sensitivity for these mild infections 
could translate to substantial underestimates of the number of infected people and substantial 
overestimate of the inferred IFR. 

The corrected IFR estimates are trying to account for undercounting of infected people 
when not all 3 antibodies (IgG, IgM, and IgA) are assessed.^ However, the magnitude of the 
correction is uncertain and may also vary in different circumstances. Moreover, it is possible that 
an unknown proportion of people may have handled the virus using immune mechanisms 
(mucosal, innate, cellular) that did not generate any serum antibodies.This may lead to 
substantial underestimation of the frequency of infection and respective overestimation of the IFR. 
One study has found indeed that mild SARS-CoV-2 infections may lead to nasal release of IgA, 
without serum antibody response.^* Another study has found that 6 of 8 interfamilial contacts of 
index cases remained seronegative despite developing symptoms and 6 of 8 developed persisting 
T cell responses^^ and the important role of cellular immune responses even in seronegative 
patients has been documented also by other investigators.’® 

An interesting observation is that even under congested circumstances, like cruise ships, 
aircraft carriers or homeless shelter, the proportion of people detected positive typically does not 
get to exceed 20-45%.’*’’^ Similarly, at a wider population level, values -47% are the maximum 
values documented to-date and most values are much lower, yet epidemic waves seem to wane. It 
has been suggested’^’’"^ that differences in host susceptibility and behavior can result in herd 
immunity at much lower prevalence of infection in the population than originally expected. 
COVID-19 spreads by infecting certain groups more than others because some people have much 
higher likelihood of exposure. People most likely to be exposed also tend to be those most likely 
to spread for the same reasons that put them at high exposure risk. In the absence of random 
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mixing of people, the epidemic wave may be extinguished even with relatively low proportions of 
people becoming infected. Seasonality may also play a role in the dissipation of the epidemic 
wave. It has also been observed that many people have CD4 cellular responses to SARS-CoV-2 
even without being exposed to this virus, perhaps due to prior exposure to other coronaviruses.^^ It 
is unknown whether this proportion varies in different populations around the world and whether 
this immunity may contribute to SARS-CoV-2 epidemic waves waning without infecting a large 
share of the population. 

A major limitation of the current analysis is that the calculations presented in this paper 
include several studies that have not yet been fully peer-reviewed. Moreover, there are several 
studies that are still ongoing. New emerging data may offer more insights and updated estimates. 
Given that the large majority of studies have been done in locations that were hard hit from 
COVID-19, it would be useful to do more studies in less hit locations, so as to have a more 
balanced global perspective. 

A comparison of COVID-19 to influenza is often attempted, but many are confused by this 
comparison unless placed in context. Based on the IFR estimates obtained here, COVID-19 may 
have infected as of July 12 approximately 300 million people (or more), far more than the ~13 
million PCR-documented cases. The global COVID-19 death toll is still evolving, but it is still not 
much dissimilar to a typical death toll from seasonal influenza (290,000-650,000),’^ while “bad” 
influenza years (e.g. 1957-9 and 1968-70) have been associated with 1-4 million deaths.” 

Notably, influenza devastates low-income countries, but is more tolerant of wealthy nations, 
probably because of the availability and wider use of vaccination in these countries.Conversely, 
in the absence of vaccine and with a clear preference for elderly debilitated individuals, COVID- 
19 may have an inverse death toll profile, with more deaths in wealthy nations than in low-income 
countries. However, even in the wealthy nations, COVID-19 seems to affect predominantly the 
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frail, the disadvantaged, and the marginalized - as shown by high rates of infeetious burden in 
nursing homes, homeless shelters, prisons, meat proeessing plants, and the strong raeial/ethnic 
inequalities against minorities in terms of the eumulative death riskJ*’^^ 

While COVID-19 is a formidable threat, the fact that its IFR is typically much lower than 
originally feared, is a welcome piece of evidence. The median IFR found in this analysis is very 
similar to the estimate recently adopted by CDC for planning purposes.The fact that IFR can 
vary substantially also based on case-mix and settings involved also creates additional ground for 
evidence-based, more precise management strategies. Decision-makers can use measures that will 
try to avert having this lethal virus infect people and settings who are at high risk of severe 
outcomes. These measures may be possible to be more precise and tailored to specific high-risk 
individuals and settings than blind lockdown of the entire society. Of course, uncertainty remains 
about the future evolution of the pandemic, e.g. the presence and height of subsequent waves. 
However, it is helpful to know that SARS-CoV-2 has relatively modest IFR overall and that 
possibly IFR can be made even lower with appropriate, precise non-pharmacological choices. 
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Table 1. Seroprevalence studies on COVID-19 published or depositing preprints as of July 
11, 2020: dates, sampling and recruitment process 


Location 

Iran (Guilan)* 

Idaho (Boise)^ 

Switzerland (Geneva)'*’ 


Japan (Kobe)" 


Denmark blood donors'^ 


France (Oise)'^ 


Dates 

April (until April 

21 ) 

Late April 

April 6-May 9 (5 

consecutive 

weeks) 


March 31-April 7 


April 6-May 3 


March 30-April 4 


Sampling and recruitment 

Population-based cluster random sampling design 
through phone call invitation, household-based. 
People from the Boise, Idaho metropolitan area, 
part of the Crush the Curve initiative. 

Randomly selected previous participants of the 
Bus Saute study with an email (or phone contact, 
if e-mail unavailable); participants were invited to 
bring all members of their household, aged 5 
years and older. 

Randomly selected patients who visited outpatient 
clinics and received blood testing for any reason. 
Patients who visited the emergency department or 
the designated fever consultation service were 
excluded. 

All Danish blood donors aged 17-69 years giving 
blood. Blood donors are healthy and must comply 
with strict eligibility criteria; they must self-defer 
for two weeks if they develop fever with upper 
respiratory symptoms. 

Pupils, their parents and siblings, as well as 
teachers and non-teaching staff of a high-school. 
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China (Wuhan)'^ 

Netherlands blood 
donors*^ 

Germany (Gangelt)*^ 

Brazil (Rio Grande do 
Sul)'’ 

Scotland blood donors'^ 

California (Santa Clara) 

Luxembourg^® 


April 3-15 


April 1-15 


March 30-April 6 


May 9-11 (third 
round, after April 
11-13, and 25-27) 
March 21-23 


April 2-3 


April 16-May 5 


People applying for a permission of resume 
(n= 1,021) and hospitalized patients during April 3 
to 15(n=381). 

Blood donors. Donors must be completely 
healthy, but they may have been ill in the past, 
provided that they recovered at least two weeks 
before. 

600 adult persons with different surnames in 
Gangelt were randomly selected, and 
all household members were asked to participate 
in the study. 

Multi-stage probability sampling was used in each 
of 9 cities to select 500 households, within which 
one resident was randomly chosen for testing. 
Blood donors. Donors should not have felt unwell 
in the last 14 days, also some other deferrals 
applied regarding travel and COVID-19 
symptoms. 

Facebook ad with additional targeting by zip 
code. 

Representative sample (no details how ensured), 
1807 of 2000 contacted provided data, were <79 
years and had serology results. 
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Germany (Frankfurt)^' 

California (Los Angeles)^^ 


New York^^ 


California (Bay Area)^'^ 

Brazil^^ 


Croatia^^ 

New York (Brooklyn)^^ 
Switzerland (Zurich)^^ 


April 6-14 


April 10-14 


April 19-28 


March 


May 15-22 


April 23-28 

Early May 
Prepandemic until 
June (patients) 


Employees of Infraserv Hochst, a large industrial 
site operator in Erankfurt am Main. No exclusion 
criteria. 

Proprietary database representative of the 
county. A random sample of these residents was 
invited, with quotas for enrollment for subgroups 
based on age, sex, race, and ethnicity distribution. 
Convenience sample of patrons >18 years and 
residing in New York State, recruited 
consecutively upon entering 99 grocery stores and 
via an in-store flyer. 

1,000 blood donors in diverse Bay Area locations 
(excluding those with self-reported symptoms or 
abnormal vital signs) 

Sampling from 133 cities (the main city in each 
region), selecting 25 census tracts with probability 
proportionate to size in each sentinel city, and 10 
households at random in each tract. Aiming for 
250 participants per city. 

DIV factory workers in Split and Sibenik-Knin 
invited for voluntary testing 
Patients seen in urgent care facility in Brooklyn 
Patients at the University Hospital of Zurich and 
blood donors in Zurich and Eucerne 
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Japan (Tokyo) 

and May (blood 

donors) 

April 21-May 20 

Two community clinics located in the major 

Spain (Barcelona)^° 

April 14-May 5 

railway stations in Tokyo (Navitas Clinic 

Shinjuku and Tachikawa) 

Consecutive pregnant women for first trimester 

Italy (Apulia) blood 

May 1-May 31 

screening or delivery in two hospitals 

Blood donors 18-65 years old free of recent 

donors 


symptoms possibly related to COVID-19, no 

China (Wuhan B)^^ 

March 26-April 

close contact with confirmed cases, symptoms 

free during the preceding 14 days, no contacts 

with suspected cases 

Age 16-64, going back to work, with no fever. 


28 

headache, or other symptoms of COVID-19 

California (San 

April 25-April 28 

U.S. census tract 022901 population-dense area 

Francisco)^^ 

(and n=40 in May) 

(58% Latinx) in San Franeisco Mission district. 

Brazil (Espirito Santo) 

May 13-15 

expanded to neighboring blocks on April 28 

Cross-sectional of major municipalities structured 

USA (six states)^^ 

March 23-April 1 

over houses as the sampling units 

Convenience samples using residual sera obtained 


(WA Puget Sound 

for routine clinical testing (screening or 


and NYC), April 

management) by two commercial laboratory 


6-10 (south 

companies 


Florida), April 20- 
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26 (MO), April 
20-May 3 (UT), 
April 26-May 3 


(CT) 

Spain^*’ April 27-May 11 


Louisiana (Orleans and May 9-15 
Jefferson Parish)^’ 


Belgium^^ March 30-April 5 

and April 20-26 

France (Crepy-en- April 28-30 

Valois)^^ 

China (several regions)'^®* March 30-April 

10 


35883 households selected from municipal rolls 
using two-stage random sampling stratified by 
province and municipality size, with all residents 
invited to participate (75.1% of all contacted 
individuals participated) 

Pool of potential participants reflective of the 
demographics of the Parishes was based on 50 
characteristics, then a randomized subset of 
150,000 was selected, then 25,000 were 
approached with digital aps, and 2640 recruited. 
Residual sera from ten private diagnostic 
laboratories in Belgium, with fixed numbers per 
age group, region and periodical sampling, and 
stratified by sex 

Pupils, their parents and relatives, and staff of 
primary schools exposed to SARS-CoV-2 in 
February and March 2020 in a city north of Paris 
Voluntary participation by public call for 
hemodialysis patients (n=979 in Zingzhou, Ubei 
and n=563 in Guangzhou/Foshun, Guangdong) 
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Brazil (Rio de Janeiro) April 14-27 

blood donors'^* (eligible: April 

24-27) 

Brazil (Sao Paulo)'^^ May 4-12 

Chile (Vitacura)'^^ May 4-19 


and outpatients in Chingqing (n=993), and 
community residents in Chengdu, Sichuan 
(n=9442), and required testing for factory workers 
in Guangzhou, Guandong (n=442) 

Blood donors could not have had flulike 
symptoms within the 30 days before donation; 
had close contact with suspected or confirmed 
covid-19 cases in the 30 days before donation; or 
traveled abroad in the past 30 days. 

Randomly selected adults and their cohabitants 
sampled form 6 districts of Sao Paulo City with 
high number of cases 

Classroom stratified sample of children and all 
staff in a community placed on quarantine after 
school outbreak 


Two of the studies included additional datasets of <500 participants that are not presented here (n=200 
blood donors in Oise and n=387 patients in the California (Bay Area) study) 

*not considered here are some sub-cohorts from this study, including healthcare workers, and staff from 
hotel for healthcare workers 
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Table 2. Sample size, types of antibodies, and population in relevant region 


Location 

Sample size 

Antibody 

Population in region* 

Population 





<70 years (%) 

Iran (Guilan)* 

551 

IgG/IgM 

2354848 

95 

Idaho (Boise)^ 

4856 

IgG 

481587 (Ada county) 

92 

Switzerland (Geneva) 

577 (4/20-27) 

IgG 

500000 

88 

Japan (Kobe)“ 

1000 

IgG 

1518870 

79 (Japan) 

Denmark blood donors*^ 

20640 

IgG/IgM 

5771876 

86 

France (Oise)^^ 

661 

IgG 

5978000 (Hauts-de-France) 

89 

China (Wuhan)'"^ 

1401 

IgG/IgM 

11080000 

93 (China) 

Netherlands blood 

7361 

IgG/IgM/IgA 

17097123 

86 

donors*^ 





Germany (Gangelt)*^ 

919 

IgG/IgA 

12597 

86 

Brazil (Rio Grande do 

4500 

IgG 

11377239 

91 

Sul)'’ 





Scotland blood donors'^ 

500 

IgG 

5400000 

88 

California (Santa Clara) 

3300 

IgG/IgM 

1928000 

90 

Luxembourg^® 

1807 

IgG/IgA** 

615729 

90 

Germany (Frankfurt)^' 

1000 

IgG 

2681000*** 

84 (Germany) 

California (Los 

863 

IgG/IgM 

7892000 

92 

Angeles)^^ 





New York^^ 

15101 

IgG 

19450000 

90 

California (Bay Area)^"' 

1000 

IgG 

7753000 

90 

Brazil (133 cities)^^ 

24995 

IgG/IgM 

74656499 

94 (Brazil) 
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Croatia^^ 

1494 

IgG/IgM 

4076000 

86 

New York (Brooklyn)^^ 

11092 

IgG 

2559903 

91 

Switzerland (Zurich)^^ 

1644 patients (4/1- 

IgG 

1520968 (canton Zurich) 

88 


15) 


1930525 (Zurich-i-Lucerne) 



1640 blood donors 





(May) 




Japan (Tokyo) 

1071 

IgG 

13902077 

79 (Japan) 

Spain (Barcelona)^° 

874 

IgG/IgM/IgA 

7566000 (Catalonia) 

86 

Italy (Apulia) blood 

909 

IgG/IgM 

4029000 

84 

donors 





China (Wuhan B)^^ 

1196 (4/4-8) 

IgG/IgM 

11080000 

93 (China) 

California (San 


IgG (also 


95 

Francisco)^^ 

3953 

PCR testing) 

5174 (census 022901) 


Brazil (Espirito Santo) 

4608 

IgG/IgM 

4018650 

94 (Brazil) 

USA (six states)^^ 


Pan-Ig 



WA Puget Sound 

3264 


4273548 

90 (WA) 

UT 

1132 


3282120 

92 

NYC 

2482 


9260870 

89 

MO 

1882 


6110800 

88 

South FL 

1742 


6345345 

86 (FL) 

CT 

1431 


3562989 

88 

Spain^*’ 

61075 

IgG 

46940000 

85 

Louisiana (Orleans and 

2640 

IgG 

825057 

92 (LA) 


Jefferson Parish) 


37 
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Belgium^^ 

3391 (4/20-26) 

IgG 

11589623 

86 

France (Crepy-en- 

1340 

IgG 

5978000 (Hauts-de-France) 

89 

Valois)^^ 





China (several regions)"^® 


IgG/IgM 



Hubei (not Wuhan) 

979 


48058000 

93 (China) 

Chongqing 

993 


31243200 

93 (China) 

Sichuan 

9442 


83750000 

93 (China) 

Guangdong 

1005 


115210000 

93 (China) 

Brazil (Rio de Janeiro) 

669 (4/24-27) 

IgG/IgM 

17264943 

94 (Brazil) 

blood donors'** *** 





Brazil (Sao Paulo)'*^ 

517 

IgG/IgM 

298240 (6 districts) 

94 (Brazil) 

Chile (Vitacura)'*^ 

1244 

IgG/IgM 

85000 

92 (Chile) 


*The authors of some studies preferred to focus on age-restricted populations: 17-70 years old in the 
Denmark hlood donor study (n=3800000), those 18-79 years old in the Luxembourg study (n=483000); 
those <70 years old in Netherlands hlood donor study (n=13745768); those >=18 years old in the New 
York state study (n=15280000); and those >19 years old in the UT population of the six states US study 
(n=2173082). 

**considered positive if both IgG and IgA were positive 

***participants were recruited from a large number of districts, but most districts had very few participants; 
here the population of the 9 districts with >1:10,000 sampling ratio is included (846/1000 participants came 
from these 9 districts). 
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Table 3. Prevalence of infection and estimated number of infected people 


Location 

Seroprevalence (%) 


Estimated 


Crude 

Adjusted (adjustments) 

infected 

Iran (Guilan)* 

22.0 

33.0 (test, sampling) 

770000 

Idaho (Boise)^ 

1.79 

ND 

8620 

Switzerland (Geneva) 

10.6 

10.9 (test, age, sex) 

54500 

Japan (Kobe)“ 

3.3 

2.7 (age, sex) 

40999 

Denmark blood donors*^ 

2.0 

1.9 (test) 

109665 

France (Oise)^^ 

25.9 

ND 

1548000 

China (Wuhan)'"^ 

10.0 

ND 

1108000 

Netherlands blood donors 

2.7 

ND 

461622 

Germany (Gangelt)*^ 

15.0 

20.0 (test, cluster, sym) 

2519 

Brazil (Rio Grande do Sul)^’ 

0.222 

0.222 (sampling)* 

25283 

Scotland blood donors^* 

1.2 

ND 

64800 

California (Santa Clara) 

1.5 

2.6 (test, sampling. 

51000 



cluster) 


Luxembourg^® 

1.9 

2.06 (age, sex, district) 

12684 

Germany (Frankfurt)^' 

0.6 

ND 

16086 

California (Los Angeles)^^ 

4.06 

4.65 (test, sex. 

367000 



race/ethnicity, income) 


New York^^ 

12.5 

14.0 (test, sex, age 

2723000 



race/ethnicity, region) 


California (Bay Area)^"^ 

0.4 (blood donors) 

0.1 (test/confirmation) 

7753 
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Brazil (133 cities)^^ 

1.39 

1.62 overall, varying from 

1209435** 



0 to 25.0 across 133 cities 




(test, design) 


Croatia^^ 

27 *** 

ND 

51765 

New York (Brooklyn)^^ 

47.0 

ND 

1203154 

Switzerland (Zurich)^^ 

unclear 

1.3 in patients in April 1- 

19773 (Zurich) 



15 and 1.6 in blood 

30888 



donors in May 

(Zurich+Luceme) 



(multivariate Gaussian 




conditioning) 


Japan (Tokyo) 

3.83 

ND 

532450 

Spain (Barcelona)^° 

14.3 

ND 

1081938 

Italy (Apulia) blood 

0.99 

ND 

39887 

donors 




China (Wuhan B)^^ 

8.36 (3.53 for entire 

ND (2.80 (age, gender. 

926288 


period) 

test) for entire period) 


California (San Francisco)^^ 

4.3 in the census track 

6.1 (age, sex. 

316 



race/ethnicity, test) 


Brazil (Espirito Santo) 

2.1 

ND 

84391 

USA (six states)^^ 


(age, sex, test) 


WA Puget Sound 

1.3 

1.13 

48291 

UT 

2.4 

2.18 

71550 

NYC 

5.7 

6.93 

641778 

MO 

2.9 

2.65 

161936 
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South FL 

2.2 

1.85 

117389 

CT 

4.9 

4.94 

176012 

Spain^*’ 

ND 

5.0**** (sampling, age, 

2347000 



sex, ineome) 


Louisiana (Orleans and 

6.9 (IgG or PCR) 

6.9 for IgG (eensus 

56578 

Jefferson Parish)^^ 


weighting, demographics) 


Belgium^^ 

5.7 

6.0 (sampling, age, sex. 

695377 



province) 


Franee (Crepy-en-Valois)^^ 

10.4 

ND 

620105 

China (several regions)"^® 




Hubei (not Wuhan) 

3.6 

ND 

1718110 

Chongqing 

3.8 

ND 

11956109 

Siehuan 

0.6 

ND 

487847 

Guangdong 

2.2 

ND 

2522010 

Brazil (Rio de Janeiro) 

6.0 

4.7 (age, sex, test) 

811452 

blood donors"^* 




Brazil (Sao Paulo)'^^ 

5.2 

4.7 (sampling design) 

14017 

Chile (Vitacura)"^^ 

11.2 

ND 

9500 


Among studies with multiple conseeutive time points where seroprevalence was evaluated, the 
seroprevalence estimate was the highest in the most recent time interval with the exception of Switzerland 
(Geneva) where the highest value was seen two weeks before the last time interval, Switzerland (Zurich) 
where the highest was seen in April 1-15 for patients at the university hospital and in May for hlood 
donors, and the China (Wuhan B) study where the highest value was seen about 3 weeks before the last 
time interval. 

*an estimate is also provided adjusting for test performance, but the assumed specificity of 99.0% seems 
inappropriate, since as part of the validation process the authors also found that several of the test-positive 
individuals had household members who were also infected, thus the estimated specificity was deemed to 
be at least 99.95% 
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**the authors calculate 760000 infected in the 90 cities that had 200-250 samples tested, hut many of the 
other 43 cities with <200 samples may he equally or ever better represented, since they tended to he smaller 
than the 90 (mean population 356213 versus 659326) 

***1.20% in workers in Split without mobility restrictions, 3.37% in workers in Knin without mobility 
restrictions, 1.57% for all workers without mobility restrictions; Split and Knin tended to have somewhat 
higher death rates than nation-wide Croatia, but residence of workers is not given, so the entire population 
of the country is used in the calculations 

**** 5.0% by point of care test, 4.6% with immunoassay, 3.7% by having both tests positive, 6.2% by 

having at least one test positive 

test: test performance; ND: no data available 
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Table 4. Inferred infection fatality rates 


Location 

COVID-19 

Inferred IFR 

COVID-19 

IFR in <70 


deaths (date) 

(corrected), % 

deaths <70 

years 




years,*** % 

(corrected), % 

Iran (Guilan)* 

617 (4/23) 

0.08 (0.07) 

No data 

No data 

Idaho (Boise)^ 

14 (4/24) 

0.16(0.13) 

14 (Idaho) 

0.02 (0.02) 

Switzerland (Geneva)'*’ 

243 (4/30) 

0.45 (0.36) 

8 

0.04 (0.03) 

Japan (Kobe)" 

10 (mid-April) 

0.02 (0.02) 

21 (Japan) 

0.01 (0.01) 

Denmark blood donors'^ 

370 (4/21) 

0.34 (0.27) 

12 

0.05 (0.04) 

France (Oise)'^ 

932 (4/7)# 

0.06 (0.05) 

7 (France, <65 

0.01 (0.01) 




years) 


China (Wuhan)'"* 

3869 (5/2) 

0.35 (0.31) 

50 

0.19(0.15) 

Netherlands blood 

3134 (4/15) 

0.68 (0.68) 

11 

0.09 (0.09) 

donors'^ 





Germany (Gangelt)'^ 

7 (4/15) 

0.28 (0.25) 

0 

0.00 (0.00) 

Brazil (Rio Grande do 

124 (5/14) 

0.49 (0.39) 

31 (Brazil, 

0.19(0.15) 

Sul)'’ 



<60 years) 


Scotland blood donors'^ 

47 (4/1) 

0.07 (0.06) 

9 (<65 years) 

0.01 (0.01) 

California (Santa Clara) 

94 (4/22) 

0.18(0.17) 

35 

0.07 (0.06) 

Luxembourg^" 

92 (5/2) 

0.73 (0.58) 

9 

0.07 (0.06) 

Germany (Frankfurt)^' 

42*(4/17) 

0.26 (0.21) 

14 (Germany) 

0.04 (0.03) 

California (Los 

724 (4/19) 

0.20 (0.18) 

24 (<65 years) 

0.06 (0.05) 

Angeles)^^ 





New York^^ 

18610(4/30)^ 

0.68 (0.54)^ 

34 

0.26 (0.23)^ 
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California (Bay Area)^"^ 

12 (3/22) 

0.15(0.12) 

25 

0.04 (0.03) 

Brazil (133 cities)^^ 

** 

Median 0.30 (0.27) 

31 (<60 years) 

0.10(0.9) 

Croatia^^ 

79 (5/3) 

0.15(0.14) 

13 

0.02 (0.02) 

New York (Brooklyn)^^ 

4894 (5/19)^ 

0.41 (0.33)^ 

34 (NY State) 

0.15 (0.14)^ 

Switzerland (Zurich)^^ 

107 (4/15 Zurich), 

0.51 (0.41) 

8 (Switzerland) 

0.05 (0.04) 


147 (5/22, 





Zurich+Lucerne) 




Japan (Tokyo) 

189 (5/11) 

0.04 (0.03) 

21 (Japan) 

0.01 (0.01) 

Spain (Barcelona)^® 

5137 (5/2) 

0.48 (0.48) 

13 (Spain) 

0.07 (0.07) 

Italy (Apulia) blood 

530 (5/22) 

1.33 (1.20) 

15 (Italy) 

0.24 (0.22) 

donors 





China (Wuhan B)^^ 

3869 (4/13) 

0.42 (0.38) 

50 

0.23 (0.21) 

California (San 

0 (5/4) 

0.00 (0.00) 

0 

0.00 (0.00) 

Francisco)^^ 





Brazil (Espirito Santo) 

363 (5/21) 

0.43 (0.39) 

31 (Brazil, 

0.14(0.13) 




<60 years) 


USA (six states)^^ 





WA Puget Sound 

207 (4/4) 

0.43 (0.43) 

10 (WA, <60) 

0.05 (0.05) 

UT 

58 (5/4) 

0.08 (0.08) 

28 (<65) 

0.03 (0.03) 

NYC 

4146 (4/4) 

0.65 (0.65) 

34 (NY state) 

0.25 (0.25) 

MO 

329 (4/30) 

0.20 (0.20) 

23 

0.05 (0.05) 

South FL 

295 (4/15) 

0.25 (0.25) 

28 (FL) 

0.08 (0.08) 

CT 

2718(5/6) 

1.54(1.54) 

18 

0.31 (0.31) 
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Spain^* ** *** ’ 

26920 (5/11) 

1.15(0.92) 

13 

0.18(0.14) 

Louisiana^^ 

925 (5/16) 

1.63(1.31) 

32 

0.57 (0.46) 

Belgium^^ 

7594 (4/30) 

1.09 (0.87) 

10 

0.13(0.10) 

France (Crepy-en- 

2325 (5/5)# 

0.37 (0.30) 

7 (France, <65 

0.04 (0.03) 

Valois)^^ 



years) 


China (several regions)"^® 





Hubei (not Wuhan) 

643 (4/12) 

0.04 (0.03) 

About 50? (if 

0.02 (0.02) 

Chongqing 

6 (4/12) 

0.00 (0.00) 

similar to 

0.00 (0.00) 

Sichuan 

3 (4/12) 

0.00 (0.00) 

Wuhan) 

0.00 (0.00) 

Guangdong 

8 (4/12) 

0.00 (0.00) 


0.00 (0.00) 

Brazil (Rio de Janeiro) 

1019 (5/3) 

0.12(0.11) 

31 (Brazil, 

0.04 (0.04) 

blood donors'^* 



<60 years) 


Brazil (Sao Paulo)"^^**** 

Unknown (5/15) 

Unknown, but 

31 (Brazil, 

Unknown, but 



likely >0.4 

<60 years) 

likely >0.1 

Chile (Vitacura)"^^**** 

Unknown (5/18) 

Unknown, but 

36 (Chile) 

Unknown, but 



likely <0.2 


likely <0.1 


* approximated from number of deaths in the Hesse province on 4/17 times the proportion of deaths in the 9 
districts with key enrollment in the study among all Hesse province deaths. 

**data are provided by the authors for deaths per 100,000 population in each city along with inferred IFR 
in each city, with wide differences across cities; the IFR shown here is the median across the 36 cities with 
200-250 samples and at least one positive sample (the interquartile range for the uncorrected IFR is 0.20- 
0.60 and the full range across all cities is 0-2.4%, but with very wide uncertainty in each city). A higher 
IFR is alluded in the preprint, but the preprint also shows a scatter diagram for survey-based seroprevalence 
versus reported deaths per population with a regression slope that agrees with IFR of 0.3. 

*** whenever the number/proportion of COVID-19 deaths at age <70 years was not provided in the paper, 
the proportion of these deaths was retrieved from situational reports of the relevant location; when this was 
not possible to find for the specific location, a larger geographic entity was used. For Brazil, the closest 
information that could be found was from a news report 

(https://www.theiakartapost.eom/news/2020/05/22/in-brazil-covid-19-hitting-voung-people-harder.html) . 

For Croatia, data on age for 45/103 deaths were retrieved through Wikipedia. 

**** Information on deaths not available for the specific locations. In the Sao Paulo study, the 6 districts 
were selected as being the most hit of Sao Paulo, but it is not stated which are the districts and one cannot 
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retrieve what the numher of deaths was as of mid-May, hut even using data for death rates across all Sao 
Paulo would lead to IFR >0.4 overall. In the Vitacura study, similarly one can infer from the wider 
Santiago Metropolitan area that in the Vitacura area the IFR would prohahly he <0.2 overall. 

^Confirmed deaths; inclusion of prohahle deaths would increase the IFR estimates hy about a quarter. 

#for France, governmental situational reports provide per region the numher of deaths only for in-hospital 
deaths, therefore the numher of in-hospital deaths has been multiplied by a factor equal to the total 
deaths/in-hospital deaths across France. 

IFR: infection fatality rate. The inferred IFR is derived by dividing the number of accumulated deaths (at 
the time chosen by the authors of each study, or until after 1 week of the mid-point of the study dates, 
whenever the authors had not arbitrated on a date for the death count) by the estimated number of infected 
people. The corrected IFR is obtained from the inferred IFR assuming that, as compared with assessing 
IgG, IgM, and IgA, 20% of the infections are missed when only IgG is assessed, and 10% of the infections 
are missed when two of the three antibodies are assessed. 
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Table 5. Additional seroprevalence data from nation-wide studies that have been announced 
to the press and/or in preliminary reports, but are not presented yet as full articles. Only 
countries not represented in Tables 1-4 are eonsidered. 


Country 

Sample size 

(antibody) 

Date 

Seroprevalence 

(%) 

Population 

Deaths (date) 

IFR (corrected) 

United 

Kingdom"^'* 

885 (IgG) 

April 26-May 

24^ 

6.78 

67897000 

34636 (5/17) 

0.75 (0.60) 

Finland'*^ 

674 (IgG) 

April 20-26*^ 

2.52 

5541000 

211 (4/30) 

0.15 (0.12) 

Sweden'*® 

1200 (IgG) 

May 18-24 

6.3 

10101000 

4501 (5/28) 

0.71 (0.57) 

Czechia'*^ 

26549 (IgG) 

April 23-May 1 

0.40 

10710000 

252 (5/4) 

0.59 (0.47) 

IsraeU® 

1709 (IgG?) 

May 2020 

2-3 

9198000 

299 (6/10) 

0.13 (0.10)* 

India'*® 

26400 (IgG?) 

May 2020 

0.73** 

1380382000 

8107 (6/10) 

0.08 (0.06) 

Slovenia®** 

1368 (IgG?) 

April 2020 

3.1 

2079000 

92 (5/1) 

0.14(0.11) 


^ slightly lower seroprevalenee reeorded in subsequent weeks 

*Assuming seroprevalenee 2.5%. Another ongoing study in Beni Brak (population n=198900, 0 
deaths as of mid-June) has shown seroprevalenee of 1.4% among the first 2933 samples. A much 
larger seroprevalence study based on HMOs has also been launched. 

**based on data from part of the sample (65 of the 83 distriets) 
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Figure legends 

Figure 1. Locations that had two or more IFR estimates from different studies. Locations 
are defined at the level of countries, except for the USA where they are defined at the level 
of states and China is separated into Wuhan and non-Wuhan areas. Corrected IFR 
estimates are shown. Within the same location, IFR estimates tend to have only modest 
differences, even though it is possible that different areas within the same location may 
also have genuinely different IFR. France is one exception where differences are large, but 
both estimates come from population studies of outbreaks from schools and thus may not 
provide good estimates of population seroprevalence and may lead to underestimated IFR. 
Figure 2. Scatterplot of corrected IFR estimates (%) in each location plotted against the 
COVID-19 mortality rate as of July 12, 2020 in that location (in deaths per million 
population). Locations are defined at the level of countries, except for the USA where they 
are defined at the level of states and China is separated into Wuhan and non-Wuhan areas. 
When several IFR estimates are available from multiple studies for a location, the sample 
size-weighted mean has been used. Not shown are two locations with >1000 deaths per 
million population, both of which have high IFR (New York and Connecticut). 
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Figure 2 



Population COVID-19 mortality (deaths per million) 
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